A quantitative assessment of the Hadoop framework for analyzing massively parallel DNA sequencing data
نویسندگان
چکیده
[This corrects the article DOI: 10.1186/s13742-015-0058-5.].
منابع مشابه
Local Alignment Tool Based on Hadoop Framework and GPU Architecture
With the rapid growth of next generation sequencing technologies, such as Slex, more and more data have been discovered and published. To analyze such huge data the computational performance is an important issue. Recently, many tools, such as SOAP, have been implemented on Hadoop and GPU parallel computing architectures. BLASTP is an important tool, implemented on GPU architectures, for biolog...
متن کاملCloud Computing Technology Algorithms Capabilities in Managing and Processing Big Data in Business Organizations: MapReduce, Hadoop, Parallel Programming
The objective of this study is to verify the importance of the capabilities of cloud computing services in managing and analyzing big data in business organizations because the rapid development in the use of information technology in general and network technology in particular, has led to the trend of many organizations to make their applications available for use via electronic platforms hos...
متن کاملNext generation sequencing is here now.
The availability of massively parallel DNA sequencers has brought the cost of sequencing genes to affordable levels but the cost of analyzing the huge amount of data has not decreased to the same extent. Thus, only analyzing the sequences of the genes relevant to the patient's condition makes the cost manageable. A panel of genes relevant to lymphedematous conditions is described.
متن کاملmtDNA-Server: next-generation sequencing data analysis of human mitochondrial DNA in the cloud
Next generation sequencing (NGS) allows investigating mitochondrial DNA (mtDNA) characteristics such as heteroplasmy (i.e. intra-individual sequence variation) to a higher level of detail. While several pipelines for analyzing heteroplasmies exist, issues in usability, accuracy of results and interpreting final data limit their usage. Here we present mtDNA-Server, a scalable web server for the ...
متن کاملHadoop-BAM: directly manipulating next generation sequencing data in the cloud
Hadoop-BAM is a novel library for the scalable manipulation of aligned next-generation sequencing data in the Hadoop distributed computing framework. It acts as an integration layer between analysis applications and BAM files that are processed using Hadoop. Hadoop-BAM solves the issues related to BAM data access by presenting a convenient API for implementing map and reduce functions that can ...
متن کامل